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BANDWEDTH SAVINGS AND QoS IMPROVEMENT FOR WWW SITES 
BY CACHWO STAl IC AND DYNAMIC CONTEKI ON A DISTRIBUTED NCTWORK 

oFCAanss 

5 TECHNICAL YIEL D Qf THj& ]^^A^RNTIO^^ 

Tha invcoucin rekit« it> the storage and Irttnpuct of jnfbmialion oo ao loteraec mxt, more 
pailietilariy, to cadiing Web-p9£9 content on the Inleimt 

}0 This is a nmtmvftition-hvpari of comnion)}'-(nvned» copending U.S. ProvJsiunal Patent 
Applicalioii No. ^/2I8^59, filed 17 }uJy 2000. 

DACKG RO UND OFTITC WVmnON 

When two or mwc computers are connected so tbot they can shruc rc5(Mm:e5, this Is refened 
IS to 08 a "ndwwi*. Wbwi two or more networks ore connected togctlicr, thU is fefeneO lo aa 
on "tntentet" (icwcr ca$e ?). The "jntemel" (upper cose 'F) is u a global. intentet, whjch is a 
phiiali^ of intcT'Connected networks, all using o coimniMi protocol linuwn a$ TCP/IP 
(Tmnsmission Cnnirol Prolocol/Iidemei IVotocol)L 

2fl The World Wide Web ("WWW*, ot simply "Web") h n act of slandnrds and pyolecols that 
enable the exch&iigc of informnlion between compulcn, more panicidaily hypertext QiVW) 
Kcrvers, on the Internet, tying tbein together into a vast collection of interactive muliimedia 
resources. Traflic across the Internet pafses ibmugh Tnnliiplc "bsKkbono" cartieis. such ns 
UUNci or Digcx, whkh forward the signaU to one aoothcr under » fyttem c^led peerijig. 

Z5 

"(^eility of .^vcrvice" (QoS) is a mBasme of the spewl and relbbility with which Information 
can be iranj^rted tmm » cootisnt source (typically » server, or mi Internet Service Provider, 
1SP> to a user's compttter (upon which is typically runniug a type of "brovrsei" sofhrare tiiat 
is used to view cofltent). "Bandwidth" is a n^rastiremait of Ore volume ofinfunnnlion that 
)Q can bo tianaioitted over a ucn^'orlc (or a ^ven portion thereof) at a given time. The higher tlic 
b.-q}dwidtbk the more data cas be transported. Lateocy is a measure of the time ijiat it lakes 
for u packet of data to tr4rvei:se tlw Intemci, front Kource to user. 

Baodwidth can be tncit:osocl by insttOfing new copHiO naio»rce3» but increasing bandwidth 
3S only indiiectly addressee latency, and increnmng bandwidth cannot itcceleratc ovcrloitded or 
slow origin servers. Other solutions are needed to ovcrctmc these bHrners. One sohilim, 
caching content, is discussed at tength in an article c»ii|led "Nctw-orX Gtchiog Gokic, 
Oplimizing Web Comcnt Delivery", by Michael A. Oouldc, March ] 999, which i$ available 
online (via the Internet) at tlic foilowing addrc^;: 
40 Wlp:/Awww,ud^tomi.com/prod^Jcts^c^worlE/tech_resouTces/'tech^cachingg^ 
A text version of the article is available online at the following address: 

hilp://w\vvv.gpoglexoni^search?q^ache:www.ijiktonii.coni/products/nelwock/ 
techj«sotuces/tech^achinggiikle.pdf 

45 As noted ib the Uoulde article: 



"Ooe way to improve perfornstnce and tnnkc it more prcO(Ct.<)blc is to 
tninimt7.e th« varinbility introduced by Inng trips across the Internet firom the 
contcttt sowcc lo ihe tisei's broxv^ter. By stonng irequcntly accessed content nt 
50 . a location closer to the user, a great deal of laleocy and unpredictable delay io 
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iba Iffteroet eoD be dinbalecL Tbe tcctmique for iloioig tins is enJIed cxhing. 
Caching is a means vf storing content ot>iecis Ttwn a Web :terver closer Ui the 
usei, wJierc they can be relrievcd more qiricMy. the storehouse of objects; 
iiichJding text pases, iniagcs, and ollwr content, is caUed a Web cschc 
5 (pronounced cash)." f ^o<i](}<;^ page 2) 

Caching is an uppivach Ibat imptv«cs Qo3 for iuer$ ^vhiJe improving Ihe 
economics fw network service pivviders. Caching cntaih lAwni Jtrequcntly 
Bccc«tei) Web content closer lo usent, tbcr^ reducing the nuniber of hO|is 
10 th»l must be tmversMl in order retrieve ooniem thai resides on a remote site. 

( Qt)u>de> page 8) 

■ In operation, requests lor content originate from a user's browser and arc first 
directed to the caching server. If the content i* cujTcntJy contained in the cucbe 
15 ond Is up to date, the content is sent to the usa without the browser 's request 

bavina to be sent on lo the oriipnating server. TOs can re<Kice 0»e Joteoc> for 
the IransTcr of content and redwe the amount of ttjue it takes to traosfer the 
content to the ustr. It also rethiues the amount of data the service provider has 
to retrieve upstiwun from tbe Jntcruei to fiiMrll ceqacsW. ((louldc page 8) 

20 

Caching is a conoqM thai is wcB understood in computer hnrdwure design. 
Modem microproccsson employ on-cbip caches of hifh-speed nxiafxy lo 
store frequenlly usetl instructions and dula locally instead of havhiK lo rend 
them frcND slower memory. Computent implciiKnt aeverri addhJonaf Icvdit of 
2S cacliing. inchiding RAM cache and even on-disk cache, all designed to reduce 

tlx latency fix reading instntctiom and dtita nnd speeding up the transfer of 
data within the .'(}'stem. The ha^ principle behind caching is lo store 
frequently accessed data *m a loestion that con be accessed more quickly than 
)i couM from the data's more permanan location. In the case of a system's 
CPU., the pemvirtcnt locntion is the dj.*. On the Web, the permanent location 
is the origin server jsomcvv-hcrc out on the Intcmcl. fOoulJe . page R-9) 

Netwrk cachmg. wrWie a newer coikcepi, is based on exactly the same 
principle and has (be same benefit of cost-efl^vcJy improving the user 
J5 experience. (Gookte. jiagp 3) 

-Caching provides direct benctHs to the end user in terms o/ nrthiced I.-itcncy 
of Web piig« Oownload <lhe tbue you have lo vrait before anything $tarl< to 
happen) and faster download speeds (the time il tjikts for ad \[ic downloading 

41) to finish). But caching al.w provides benetiis (o network scrvica provider. By 

storing, or caching, Web content that veers reqxjest from remote servers 
lociitly. fewer of these i«|uebts hiwc to be sent o»'t over the lotemct t(s be 
fuliiUed. As a result, tbe access piuvider maintaining a cache hns its tip^drcam 
bandwidth requirements reduced. This reduces tho bandwidth the ser^')ce 

4:i provider has to purchase in on5cr lo provide its customers %vilb a satisfactory 

Web experience. (Ooulde. pa^tc 3) 

"dching also provides several benefits in a Web hosting cnvironmeot in 
uMci) an tuxcss proviiler hosb Wdl} sites lor many customers. Ir can reduce 
so the impact of traffic Rptkcs caused by high-iotcresl content and also serve as 
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the bsBU for a variety of value-added service^ cuch ts pnwidins vdditianal 
cnpocily, guaranteed sendee levels, and replication teraces. (Sfiuld& PHO 3) 

"[Caching] netwiMk data makes h possible lo provide useis vritb m optimal 
experience and prcdiviuUe icspunsc liiacs. Witli typical cjche hit rates, the 
nsCT experience has u hijjbci quality of .-^rvice (QoS). Imprpved QoS provides 
n^^ncimt benefits to ISPs, conteiif providers, ftiid corporate shea. Bcucr QoS 
rcfiilts in higher mmofnei- k)>'&liy and rcicntton. ll helps create o sironscr 
bnuul equity, both for the access provider aixl for content providers. Content 
providers who ensure tliat their conreni is amply cuched ihrotighunt rhc 
oetwoik , close to u$ers» will ultimately see more visitors Accessing tbetr 
content. fOoulde. page S) 

"Web browsers nlso implemeot n fonn of caching, storing rerenily accesscil 
Web content on Ihe xw fs luircJ <HsVc and reading from \hal locnl cnche ftt" files 
instead of acce&^ng the Internet. This worfcs \veQ when the user hits the 
"Bnck" and "Forward" bulluns m tlicir brotvscr during a session, hut it docs 
nothing ifilic uses is biowsins a site for the first tinw. fOouldc . page 9} 

"Neither the browser's cache nor a Web server's cnchc can Address network 
performarxic issues. By placing a cnche of Web content on «he network 
between the user and ibe oriBmating Web sites, the distance \hat ctnncnon^ 
accessed conlnit has tQ Havel over the Internet is reduced, and users 
experience quicker re^pmse acd lEastcr peifonniuice. Network caching inkes 
advsnl^ of the fact that some conieni is accessed more fie(|TKDlIy that oUier 
content. i)y implemeoting a caching sohitioni network service pfuvidcts can 
provide » betier Web ejcperienue to their cu.<itomcrs vvhilc reducing the total 
rtqeircmcnt far high-bandwidth ccmtcctioDt to the InlciDct. tCooldc. page 9) 

"When a caching solution Is i>ot in place, requests for cunteat from a browser 
and the content delivered fixwn Ibe origin server miisl repeatedly traverse a 
complete trip ficow tbe je^oesfiog conip^rtcr to ibe computer that ius the 
i-ontcut. Tbe Web browser sends a tequ«l for a Uniform Resource Localw 
(URL) thai rcfcis to a spedfic Web page on a particuiar server on tbe IntemeL 
Tlie request is touted to the server through the nonnol TCP/IP Jtdwork 
liBna^iOil. I'he content requested fioea the Web server (also known as an HTI P 
server) may be a static Hll/SL page with links to one or niote additional ines, 
incloding graphics. The content may b1.«i be a dynamically created page that is 
generated from a search engine, a database query, or a Web application. The 
HTTP SCTVCT returns Ihe requested coritcnt to the Web browser one jQ]c at a 
time. Even a dyniunically created page often has static compoijcots thAl are 
combined m(h the dynamic content lo create the fin»l page. (Goulda page 
13) 

"When caching is used, frcquaitly acccsscti content is stored close to the user. 
It may be slorcd inside a firewall ou the ascrt corporiile LAN, at the user's 
ISP, or at itome other Network Access Point (NAP) or Point of Presence (POP) 
lucaled closer to the user (baa the tmjtnily of Web serve». If tlic requested 
content, or fikt, is locxied in Ibe cache and Is current, or fresh, that is 
considered a cache hit. The more Crcquent.ly ^iser requests can he sensed from 
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fhecccKl»s.1})«lu8lx7 the hit late, thebdter Uk user '9 perfncntanoc» snd the 
less data iht service provider bos to retrieve fnMn Ibc Inlcnict on behalf of tf» 

iKcr. (GouMc. page M) 

5 "Similar issues cxisl for V\T file transfers, with nn FTP server hantHtng each 

request for u file svbniitted ty Uio ^TP clhajt applkaiioa DcJays n\d 
bottlenecks cnn be u biggrt problem wnh FTP because (he D'pical size of a liJc 
transfcired by FTP is laijcr ihon a typicaJ JfTML file. Slreamiug sudto and 
vl<teo an; Mlditioral examples of on Iftleroet apitlicfttion lhat can benefit fn»n 

Ift rjirliing cflfticirt close tp clul iom. ^Ooolde. pagfl 14) 

*14c!vrork caching can be applied to content delivered crver many diffoeDt 
protocols. These jncliKfe iriTP. NhTTP. KfP, RTSP. and otbere. AB ere 
duractcrized by fcuving some proportion of static content and hi^ tiltlizafion. , 
15 Csdic stiver njpport for each protocol Is, of cderse. lequlrod.'* (page 1 4) 

"Since the oiclic mmi be kept fresh, there will slid be traffic from the JSP ont 
10 the biicmcf, even if every bit of content yc<iucstod by users is to n cnchc 
Pai^c frailness has to be assmeU, and new content matt be downloaded. But 

20 Vy vsin^ caching, btUKhvidih utilrTalicm can be optbnized. It is even possible to 

n$e a cache with dynamic content, since even these pn^'es hirvT some static 
content that cm be served from a cache. Depatdiog on the distribution of 
tnfiie and the scabbility cf the cache, up to 40 percent -of user KHP leipjcsts 
USD be tidini off the network and served from a cttche. This males netvwks 

75 more cfficicrt, enablbig better pcrfomuince to be oCfeied at kmer cost." 

(£MI.&PRSel4) 

Tbc Go^tld article discusser t^e effective ysc of cachcs> load balancing, when: lo locate 
cliches in the infrasfnicnjrc, ^nA iresignijig cnchc-friendly web content. There b rIjo 
30 mention of protocols which have been developed - for example, Web Cache Contiot Protocol 
(WCCP) {p^ I «). Jhwe is also discujufion of appropriate use of the "expire" and "max- 
aec* headers in the HTTP protocol fCjouidc . page 27). And, as expressly stated by Gvulil. 

"With liood design even dynsimcany gpnemteJ pages cnn bcneCt from 
>S caching. By separating the dynamic content of a pi^e from the static content. 

the statk mn/eirt can be cached ana the dynamic ccntent retrieved and 
downloaded sepanthtiy. (Gonlde. nace 28; emphasis supplied) 

It is IbcTcfore Known to cache static data from Web «rcs usinjj a dedicated wnnpirtcr's RAM 
40 and hanl disk, so that ihij conjptUer can act ai a proxy bctvycen the WWW (Nvcb") servers 
and ibcir users. Several requests for a given Web page can then be served at the cost of a 
sraglc request lo (he oriiiinal Web server. lotoravt Service Providers OSPs) commonly 
provide such a caching service to their cvstMneis. However^ this technique sufl^ from tyvo 
niaio drawbacks: 

is • it is not applied globally, and even ««« of the Web sites who have acccs-i to a cache 

have to deliberately activate this fealure, of which ihey are oflen no! aware; and 

• it does not apply lo pages which change very often, as is (he cafic with dynamic 
conlent (oonlcnt which is generated m'the>fly 1^ the remote wrver). 



» There therefore exists a need for the caching of dynamic tiooient. as well as static conlent. 
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(ftpvcnry/Dt/inUlons 

Unle&t ot})cnv)5(; noted, or as may be evident from ihe context of iheir usat^c, any tcire:^ 
ftbhrcvMtiovui, acrottyna or ancntitic 8yrat>o)s ami nutations u»d heiciti are to be gjven tlwir 
5 oidioniy me&iiing in Ihe tcclvdcaj discipline to which Ihe tnvaitioa mnil neuriy pectwny. Tbe 
tbllomD); ti)ns»u7 of lenns Is intended to tend chtnly and consfslcncy to Ihe vnrious ilesoipUnns 
coDioiiKd heietn. as well ns in my prior nrf documents ivhicfa rtacf be ctled: 



10 Cacho Server 



Cookie 



Dynannjc Concent 



WTTP Server 



AtiigWy optimiaed.appHcatioo thai stores Gvquenily ncccsscd c*inrciit 
at strategic aggrq^y (ion points cJose to the vscn requesting that content 
in Older tn reduce tbe impsKl of tlclays and network bottlenecks. 

Cache AfToy Koiiting Protocol A pfoio«>I for synchronizing 
mvltiple cxbe servers. CARP maintains u shnreO naisespace that nwpS 
Bny Web object's address (UKT^) to only one nrtic in the array. 
Kequcsls; we routed to tlial node. 

The most eominon meoniog of "Cookie" on the Jnierntt refer.? to a 
piece of InforromioD scut by a Web Server to a Web Browser th«t the 
Bfotvscr soft^^urc \% expected lo save and to send back io tbe Server 
n-hcnever the browser nukea odditiooal requests Jcom (be Server. 
Depending on tbe t>pe of Cofrfue t»ed. and tin Di:iiW5qr*s settings ibo 
Browser mny oecepi or not accqit Hie Cooing and may save tbe 
Cookie for cither a short time or a loo{{ time. Cnolries mishl contain 
infonaattuD such as Jogin or regjstmtton infbmiatiog^ online "shopping 
carl** informat'icn, u.%r preferences, etc,. When a Server receives a 
ftv{}^ from B Bmwscr (hat uiclud^s a Cookie, the Server is Able tu ttse 
the infoonation stored in Ujc Cookie. For cxwnpJe, the Server might 
cujitomt7e what is sent back to the user, or keep a log of panicubr 
OKcr*.-* requests. Cookies arc usually set to expire after a predetcnnincd 
nraoonl of tioie and are usicilly saved in memofy until the Drowcr 
software is cloiwd down, at \vldcb time they inay be saved to tlisk if 
their *Vj{pirc time" has not been reached. 

'Live" content which is updated on a itBulur basis. Exftmples of 
dynamic content mi^il indnde a "cumiil tenqierahire* display oo a 
weather web site, search restUts, or o "CiirFcnt Top Headlines' item on 

n news web site. 

A server that implements the HTTP protocol, enabling it to serve Web 
pages to cilenl agents (browsers). IHTP Servers support intcrfnces so 
that Web pages c^n caII e?(tcrnnl jvuiframs. Tbcy ftlw s^^port 
cnciypfion mechanisms for securely exchanging intbmiatton und 
aulbenticRtion and access control mecharasms to control acce!« to 
content 

Internet Giche Pnc^tocol. A prolowtl forsynchroni7.mg multiple cache 
seivcn. Fiicb time a cache server experiences a miss, ii broadcists 
messages in all peer nodes •iskii>g whether cnch has the content. The' 
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rcQttcsting server then must issue d letiuesl fyx tbe content ond ibramd 
it 00 10 Ote Dser. 

Proxy Server A piuocy server acis ta on inleimcdiary between • usn the Intemel 

so on enterprise can cnmre seouriiy luid ndministrative contml ond niso 
provide a cachiog scivfce. A proxy server Is usually nssociaied with w 
part of 3 e*iovay icrvcr Uial scpurulcs Ihc enlerprisc network frooi the 
oulrndc nctworJt or a fhcwtiii that pio>ccts Ibc eijJciprisc network from 
out^ile intmsion. 

Rootere Routers are Ibc dcvjc« thai buiJd a fully mlovoniiccfcd oehvoA out of 

0 coUeclion of poiol-liv point links. Rouim un the Inlanc^ exchsnge 
bfannntion peilatiiiog io their locfti section of the nct>vork« pnittcululy 
how close they arc topologicfiliy (o local s>-stcim. They coUccti'vcly 
butJd B mop of bow lo get firojn any point in (he brtenxt to any other. 
Packets ere routed based oo tbe exchangeii mapping mtonnotion. until 
the last router connects JlTccUy lo the target syslcnj. 

Static Content Tixcd" w lung-lenn nnchaiigiug componenl« of vveb pagta srtored am . 

lUes Ihiii an either never cbangcd or arc chsngerf only on an fnfie<)tieDl 
bs$is. 

Swifcbes - Hi gh'Spcednctv^'orlc(kvicc8(hi]il)'picoIIy sit on the periphery of the 

Inteiijet, Switches difltrfixun roulers in pryvidirg hijihw perfwuance 
al » lower price but with limited functionality. Typical STvitc)>c3 cnn 
nmte traftjc locally but aren't concerned wih complexities of routing 
found in the high-speed Imcmct backbone. Switches play ra inipiirtetnl 
roje io caching because ilnsy are often used lo dtvcrl Uie cavhctbic 
traffie to Ow cocbin^; system. 

Ilyperftxt Markup iJinguflgc. A spcdiication tuised on Standtird 
Generoiized Mailiup L^nstwge (SGML) Jot tagging text so ihut it ntiiy 
be displnycd in n user agpjit (browser) in a stanibid way. 



H-fWL 
JP 

40 

IP Address 



Uypcfiext Transwission Pfotocol. An appIic<)tton-levet protocol th^t 
nins on.ioii of TCP/IP, which is the foundation for the Wodd Wide 
Web. 

Internet Protocol. Tbe oetworfc layer for tbe TCP/IP pmtocol suite li is 
a oonneciionless, best-efTort, pacliet^iwltchiog pnitocoL 

A 31-bit address defined by the Inlenict Pfotocol tbat is usually 
represented in decimal nulotira. Il uraquely identifier cacb computer 
on tbe Internet. 



Protocol 



An agrtx-d-upon set of technical rules by which ooinputcre exch-mge 
JuforjnalioiL 



UK J, 



UnifhnD Resource Locator. Tlic tneUiod by whidi Interoct sites are 
addressed. Il includes an access protucol and cithtT an TP address or 
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DNS oairre. An cxnmpJc is hUp://wvw».ilonwin.cwin. 

Short for Usti*s Network. A collection of lew of ihuusonds of biillrtiD 
boards rcsidhrs on the InteuKl. Bach contains diKoisxion efWtpn called 
ncwsjiroups dedicated to sitskm lopivT:. Messages arc posted ami 
rcfponUctl to over the Navwirk NexvvJ Tninsl'er Protocol (NNTP). 



Web Saver S«c HTTP Scrvtr. 
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BRIEF DESCKrFriON Ov mU INVENTIQN 

An object of the iuvtnlioa io piwvide a tcchoiqiic foi mtociijg bnntJwidth usnge of WWW 
servers and intproviiig Ibc QoS of WWW uies. 

$ 

AcciTtling to the mvcnHon, a tcchniqiK Tor caching ob)oet5 havins dynamic content on tho 
IiUcinct gowntHy comprises dbposing a cache in the Internet for storing raid updating copies 
of dynamic conlenU The t;»che may he diqjoseU ol a locution selected ficom the group 
consisting of major fntemet switching locations, dial-tn (iggregntion points^ And corpomie 
10 gateways' The cache nuiyolso store and updnte copies of slatio content. 

Acw>idii>g to ft feature of the invention, iiptUc diaractcristicsf of the objects arc detcrmiucd, 
ant a lime to live flTL) patarocjci for the objects is adjusted based upon Ihc update 
vharactcrisctcs. Oenerslly, the object is updated if its TFL is less Ihon its njie. The TTT, for 
IS- an objcci may be AcUustcd to: 

(t) maintain its pmhabili^ ofcnor below a predclennincd cnm pntbablKty threshold; 

(ii) mainuin its error rate below a predelcmuDed enor probahiitty threshold; or 

(iii) matiMain ib delay time below » predetermined delay Ihrosbold. 

20 According to the invention, a mclbod of responding to a user request for an objnct having 
dynamic conteni, said object ori^onaiing Troni a acnvi, comptists storiog a copy of Use object 
in a cache; estsWishing a time to live (TTL) for ibc objtcl; receiving the aw request at the 
cache; 

ftilftlling flw user request wih the stored copy of tlx objcci if io TTL is greater than ii« age; 
2S and fetching on upJatnl copy of the oiiject and rvspunding to the user request with the 
updated copy if (be TTL uf (Iv stored copy is less thim its age. 

According to a featwo of the invantioo* the TTL for the object Is fint set to q nBsonablc 
lower limit (Tnm) tmd is Ibeo ndjusied based on the fiequency at vrhtch the object actnallx 

30 citanges. 

According to a feature of the invention, each lime the cache fetches lite object from the 
scn*er, the cuchc perl'onna Iho rollowtng procednms: 

a. if another fetch for the some object is ongoit^ waiting for the previous fetch to 
35 cumptclc; 

h. fetching the object fjros) the «ervec 

c. replacinc the cached copy, if present, by the fetched otgect, albr having compared 
thero lu delemnne whetlier (he object had changed stn^ U ivas last feldied; 

d. initialrzing or updating the object's change statistics accordingly; 

40 e. marlcing tlv: obj act as static or dynamic cuolcot depending on the server's reply: and 

f. if the object Li d>Ttaraic. setting its TTL (7) to an appropriate value wtb lespect to 
an overage time betv^cen changes (t) determined liom (he object's change statistics, the 
number of user requesis per time unil (r^ detennined fiom Ihe objects access statistica, and 
one of the following pwxfcdures (A-E); 

4$ A. raaxinrum exror probability; 

B. maximum error rate; 

C. mnximum delay; 

p. any wmbination of the above (A.B,C). taking the lot»"cst 7"; or 
B. any combiatlioii of the abo^'c (A.B^C), but keeping T within a predetcimincd 
jO window ofrcaftonable" values bounded by Tm'm and Tmaat. 
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FrefenUy, (he cncbe) are detl)cxitc»l machines ood file pUieed so ibai Web brow3iAZ passes 
through ihe ache instead of going b)I the way to Uic ori^ml ales, in 
locslions, )dnilly wilbin Oir nenvork of ISPs inovidins (he Internet cunneclivity to ihe bigficst 
S niuttber ol'uwrs in those locations. In this numocn 

- Ihc oscn of those JSPs uod. lo n katsur cxtait, neighboring ISP*, ^v^n cijjuy 3 hu^C 
QoS and speed improvement, for nn>st of the Irt/Tic wil} slay \>ilhin or cIo«: to the ISIV 
intenia} oenvoiVs uod jiut need to go ihrough Ihc liis)]ty-1undod Inlemct buukbons; and 

• Ibe origtoal vveb*sties wil/ no longer need as miicb bandvwUh, noce ihc caches wtU 
10 ahsort) most of the load. 

$tiKe many wcb-sEtcs tend lo serve aU-dywiniie content, dysamic content wjJI be cached, ns 
well ax istatic conlenL However, H is not jneiely a oialler of the caches reraenibcriitg Ibe 
content, as tlieio is op iniGcution as to when the laltei is g«Jig to change. >1a)ce» it nced!^ to 
15 be rcioaded,peribdiGid]y,jnsteiidur8inqdy>xrving the cached copy. ' 

TbereAre, the cliche reloads a page (felcbcs an object from tlie server), whenever its 
conespondtng cachrd copy has »K>t been rcfrcslied for a givtn time (lime lo live, "TfL')- 
TTL can be thought of 9S (he "shelf life" of the poge. TTie cache Cist, sets ihc TTL for a 

20 dynamje object to n icasonohle lower limit Ti^. Tbeo. over time, as tb« cadie retowls Ihe 
page aeverel times, it can track the frequeocy ut which (be pag^ actsally changes conteni. and 
adjust the Tlx for (hat page accordin^y. This TTI/ technique mtniics a ooinjDOO caching 
nietbod for static pages, if the ori^nal semt of (he page xpeaftcs a TTU But since serras 
for dynamic pages do not ^ci^ a TTL for than, the cache has to establi^ a leasumble 

2S TTL of its own occonl. 

Therefoj*: 

- whole wcb-sitcs can be cached, iochidzng their dymtmic content ; 

• tiafllc be(\veen caches and the origiool sites is better maxuigej; 

JO - wliilc wine users ought get ati outdated cojitcnt. in the case ncf a p»ge chimgtng 

before (he cache reloads it, it is possible lo limit the delay bcrBveen the two (which cixdd 
huppen anyvnty vAUtout the cache, when the network gets sahuated) ns well as ihe error 
proba(>)Uiy; 

• the system can adapt, in teal &De, accotrdiog lo the nnmber of requests to each page 
}s and the actual update fivquency of the page. 

Other objects, fcutures end advanttges of the invention will become apparent in iighi of ihe 
following description thereof. 
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Bnvy DESCR IPTIO N OF THE DRAWTNOS 

Reference vyill be tniK^e in ddm] to pfcrexreti oubodiTiitDts of ibe mventkm. exnmplBs of whicli 
may be illustpitcd in Ujc itLxompmyuig (hruwhiii fignnRi. The Ggtires iir? inlendcd to be 
ilhistnrtivc nol linilii^ AJihw^i ibc invention it fgenaraily desert m (be wiKeKl these 
i prdferml onboJinKnliH it sbmikl he inutcrstood Ihst ti is ncn imciMlccI In limii Ihc s^jiH nod 
soupc of Ihc lovcniinn lo these pnriictiJai embodiments 

In flowcbafls pnnKOIrd hcrem, reclangMlar boxes genmilly rrprcscni a scqi)raliat step being 
performed., a (ien&mlly rccliuii^tJar bo^ with poiiiletl ciwli: represents a (iecision Step (test) 
10 having two mHtually-rxchisi vc results {"Y"^Yi;<r, "NM^Jo), »rJ an empty cmle is not n step or 
a test, but is mciely a grapUical junclioo point ftt which two ftr more paths hi Umt Howchint 

The sinicture, opcretion, and advantages of the pteseni prelbred embodiment of the iovntfion 
15 will become fiotber apparent upon considcfatran uf (fa« fuUowinB demiption, takei) in 
conjunction wih Ibe accompanying fii^ures; whtTciu; 

ffginr 1 is a greaUy simplified schematic iI]^t5tntion of (ivi bicemet* iJhistr9tin|( oo 
embodiment of (he caching :^teni of (htt present inveniioo; 

:o 

Fi^nrc 2 ii a Howcbart iihxsirating how user nnpeats for static and/or dymmic cootait are 
handled by the cache, according to (he in^'Ctttion; 

Kgore 3 is a graph illnstf atutg an aveiage error prohahiliQr* aoccnfing to an analyas of the 
25 inventiwi; and 

Jlgnra 4 is a giaph iUusintting the evolution otf enor proboUJity, aooordfng lo ao djaly^9 of 
Ibe invcttlkn. 



(41) 



JP 2004-504681 A 2004.2.12 



Figure 1 is n great)y simplified (Khemalic illustration of the Internet cnvironinaiU iltustrating 
an embodimenl of the cuchlng sysletn oT ihc present invenlioa. GeiKnIly, a user 102 /oakcs 
5 H "Vcqwsl" for (wi "object" {c.^., ii Web page) whidi \» moJe uvailBble on ihc Intemet by q 
strvwf (e^ ISP) J 04, (Tlie object typicHlly origimtes at a cnnlent providei, tiot sho^D.) A 
switcb 106 inlcf^Rloes a cache (cvhe sorvtr) lOS to (he Internet. Hie cache raiy contab a 
cojiy of the Web paga There m two |C5$tblc 'responses' to the uw rcqiiejl « either the 
server "tervcs" (or 'service^ ibe rcquwt, or it is 'ftjJfillcd" ir> the ciche. In the bttcr case, 
n the contcnl must first hare bwn "transferred" to Ibo cache, which moy periodically 'fetch" (or 
"reload") updated Web page conlcDl from Ibc scavcr. These icmis will be adhered io, to the 
extent possible,. b the discussion thai WJows. 

As discossod in Iho Goiiltte article (e.g.» page l$» Ilhistrattoa 4), the svvilch 106 mny be a 
IS. higjli-pcrformanco processor that can Iim^ ot ncMvojrk tzoflie ood ma ke rou ting Oedshnu bated 
on piolocoU above the IP level. M a rc$ull, the switch can direct HTTP (and other) trafik to 
cadics (108), wid send tlic res* of the traffic directly to the Internet. "Unn is exnophny of 
one of a number of paviiblc system architectures. Which itrchitcctorc is used depends cu 
M%-craI factors, inciudios vrhcrc the cache is implemented, tlic primafy purpose of the cache, 
79 and the owturc of the hTsfiic 

As fttrther di.scussed in the QyOde tnticle: 

"Oichcs can dthet be deployed io a transpaieot or aontransparere fomt A 
25 nontranspnreDt cache is explicitly visible, ;mcl browsers w other caches thai 

use the ciiclie ate overfly canftji^ircd to direct tratfie to llic cache. In this case, 
Ihc cache act.i as a proxy sigcnl for the browser. fitlfiUing requests when 
possible and forwarding requests to the origin server when necessary. 
NontriQisparoM caches are often a component of a larger pxwty server acting 
30 as pail of a gateway or firewall and adcVessmg many difTcrcnl appficatioos.' 

(GCTiIdc, pages 1 6- 1 7) 

A iransparcnt cxba sits la the oe^York Oow and furKtiocs invisibly to a 
browser. Fch ISP iiod enterprise btKlLbone operatkms, u Innsparenl 
33 configoratjon is prefared because it minim'izes the total adininistiative and 

support burden of supporting luers in conllguiing their browsers to find the 
cache, ((ifiuljls, page 17) 

Caches should be fanpJemmted innsptneatly to maximtze the benelils of 
40 cadiing. A ixtnlranapervni implemcntntiofl xoqutres having brovvaers nuniualfy 

configured to direct their requests for content to the caebe server. In a 
linnspiwni confi^uratjon, cache benefits ore delivered to clients vnthout 
havijjg tu reconlijiure the browser. Users aulomnltcuUy gain the bemrfitji of 
caching. (Goulde . pnge 17) 

45 

Id an enterprise eiiviioomeni, iranspa/eiKy can be implcnicated cither through 
autojnatic browser coiifiguration tx by intercepting traflic on the nctwoii- 
Both Netscape and Microsoft provide ntiJities for centrally configuring Luge 
networks of browsers and for customizing browser ccmilgiirations beang 
30 instaJkd on users' PCs. Browser plt^tns can also jnx»'idc automatic 
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umnuMrsliun. Alihougb Ihis approach is tmnspwent to Uxi user, it (1oc« 
ra|iii» ixhnin»irtilive efluTl on an ongoing ham. fCtotilde. pnge 17} 

There arc several ufXioin for proviiUog Ifan5i))arent caching 

• The otichc can be ciinngincri a.<t if tl >vcrc a rtnilcr so Ihst all blemet-bosed 
tmlliu is niincd M it. This is a tTare;porrt)i conljg^rstitin \h9\ nrquires no 
connguraiion of the hrov^'scr; the hrcnvscr ur do^vnstraiim cache is unaware uf 
the cache's exislence tiDl still bmeGts Uoao it TIic doNvnside is that the syslcia 
on wbicli tlw ctchn rcadcs hus to dcvole some of its resutirces to rouri;^ ariO 
Jbe cachif becomes a niis5iu<]>criiica} pRrt oX the networt:. SvphJsticotcd lOiiicr 
configuration wtbpoiicy-bwd routing cao niininiite sooit of these issues hy 
only directing HTTP (TCP Fori 80) braffio to the cache, bypa.«ing Uw cache m 
the event of faihve and scoding traffio directly to the lotcnicl. fGo<iltle . pngp 
17) 

- An tncnasingly popular option is (o nsc a Laytt 4 switch to iittcifuce the 
caclie to the Inttsnel (sec Hlisstnition 4). 'Iliese switches^ oirrcntt}- niTe/ed by 
Alteon, rownchy, AnowPoint. unci others, wre hijlr-pcribrmflncc fvoccwors 
Ibftl can look ai ixivroA IralBc and inake routing tkciidoDs b'JSed on protocol 
above tlte IP level As a iwult, (he switch can diretl HTTP (<n)d other) tralllc 
to the cache* and scAd the rest of the (rafRc rfirtcily to the hitemet ... f I'jhe 
switch can parse the HTTP retpiert and send the request to a specific ixxfe in a 
eocjie (am based on the URL requested. Usiog an tntelligcni switch keeps 
ittUKcesswy network traffic off Ibc cache, simplifies deagning for availability, 
and distributes loading on the cache farm based oo specific URLs. (Goulde, 
page 18) (An architecture similar to this one is dcuuibed hepeinahove with 
respect to Vlgarcl} 

- Anolber option for transparency is the Web Cxhc ConlroJ Ftotoool 
IWCCP). WCCP was dcvddped by Cisco Sysleow to aUow Web caches to be 
tran^rently installed b netwoiis using CL<5Co IQS- based routei?. Wilh 
\VCCP» mp traffic Is redirected to tho Wcl> cache Instead of the origin 
server. WCCF does not require any changes to the network ardhilBcture> 
thereby simplifying the iisplementutiuo vf IranspQiient Web caching. fOoulde. 
page 18) 

- Web Proxy Autodiscovciy PiDtocol (WPAD) is o new pn>pn!ied standtrnJ 
protocol, whio}^ when integrated wilh browsers, strearaujg mcdn clicals, and 
Other Internet client software, is dcsisned lo automatically locate caches and 
seivices on the net>voifc withwl requiring any configuration by end users. 
WPAD provides a ilexible, vendor'ncuttal software altcmtitivc to existing 
cache ti3a<g>arcncy sohittons tliat utilize routing or switching equipment lo 
Ibc fpiurc, WPAD-enablcd client software will BUtomaiically connect OJtcrs 
with embedded network scii/iccs in their regiun, providing simplicity for both 
useis and the network providers that deploy these sen'ices, fGoulde . pages 

Cuching syslcins can be u.ied to optimize the pcrfoinisiicc oFa Web ssarvtr site 
as we)) OS lo speed Internet nccan for Web browser uscn. In a reverse pToxy 
cnnftgurnlion. the caching system sib in front of one or more Web servos, 
intercepting iniflk tu those savers and standing in, or pnixying. for one w 
more of the servers. Cache xcrvm can he deployed ihrougbuiit n network, 
creating a distributed network for hoxted content. When necessary the proxy 
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cmrhe scrwr win request djiifitoic and oJ}»sr dtort-livcd content from Qvs 
urisin scrvat. This enables cca)teni fi\>m Ihc Kite to bo served from a local 
cache instead of from Uic origin server. The proxy saver con be opttmizeU for 
tu^ peffonnoneev elTicient opcnUion, omservins resuurces. ood ofMuiiUin^ 
S the origin server frojn serving static content. Revejw proxy ractnng praviifcs 

benelii£ to the ecccss pro>dder as weU as lo the user. Those bcnufit* iuchidc 
the abjlil>' to enable load bnlajicinjfc provide peaK-dcmand instiranceto assure 
svajlsbility, utd provide dynninii: n>irrQriiig of content for high aviulnbiJily. 
fCiooldc page 20) 

10 

There are three gencrfil cKvactcristicb' that describe where caches are best 
localcdooaDetworii: 

- Clhoke Points in the Network TralTic There we locaiiorw where a large 
miyority of network traffic posses and is therefore visible lo Uie cache server. 

15 fha allows the cncho to handle n>ore Fe(]ucb(s and store more content llian if 

located SDitKwhcrc that can be eiisily bypassed. 

• Points with High-HetworV Load. High fmffic conditions allow hiehcr 
cache uiilizMiiun mvl tberel'oru greater benefits can be achieved. 

- LocMtions that Prudncc Greatest Econooyc Bisnefirs for a Cicbc. Points 
20 where tisei^ will beoeiil from high cache hit mtes while nko jerhicing 

vpslreani bandwidth rcquitctnents will provide both QoS beneOfs ;^nd 
economics for the access provider. l<ocalions with these cliartctcristics are 
typically foimd at major IntcnMd avkitchins locations, diid-in asB^Sation 
points, or corpomte gnteways. (OottlJc page 20) 

2S 

Locations with these characlerislics are typically found at m^OT Internet 
svntching location.^, dial-in aegregplion points. Of cofporate «ateways» 
iDOlvidtog: 

POP »Dd DlAI^irP (sec GoulJ e. page 2J, Ulostralkw 7) 
30 NAPS (see g ou Mc. page 22, IllustrHtion 8) 

Satellite (sec Ootildc. page 22) 

Intcmatlono) Gatovny (wie Qtwldc page 23, Illustration 9) 
Web bo$tin<( (see Goul(le> page 24, Illuslrittion 10) 
Last Mile (see Ooulde. page 25. llhislmtion 1 1) 
35 Cof poflite Gateways (see Cwil de. po^ 2S, Qtustration 1 2) 

A pci»ni turring ordinary skill in the art will readily tmderstand where to locale the cacl)e(s) 
of the present in^-entioa in light of the description set forth hereinabove. Generally speakinp. 
the cache oftbe present invmtion con »je located anywhere that there is (or could be) a cache 
40 serving statio content, or it can bo incorporaicd into an existing cache which fullllla rctpicsts 
for »tot|c content, with the additional foncdonsility enabled according to the techniques set 
r*>rlh bcknv. Or, it con be pivvided as a scpajpte, dedicated machine (computer). 

Figure 2 is a flowchart illustrating how user requests for static and/or dynamic content arc 
■tS haniHed by ibc cache. 

In a first step 202, for a user request for an object, the cache dctcmiincs whether the 
icqoeMed olQeci is in cache. If not (NX the user tequest is passed on to the saver for 
servicing tlie request ami mcfuiwbife» In a step 2IM, Ums cache fetches tbc object from ilie 
S9 server in antklpalim of the next request for Oie object froni Ihe same or another wser. 
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OcnexaI!y. lr» \hxs exampk^ h1) ret)(icst$ for obj«ts ore presurncd to go through a cftche server, 
which is trjusparenl to the \Kiex. II inteiccpts information requests and deckles whether it will 
provlJe n rt:^n5c from a cached local copy w from o remote infonnaiioo source. After 

S fctchio^ infomwtion from a iocal »urcc, ibe coc)io servrr iitxides w)icihei to store it locAlly, 
and if KJk for bow long. A request for infomiaticn which can be provided from a locaj copy 
is luiown as a 'vacbe hif, Convrtscly, a rct^oest for inforuoation which is not stored }<Knl]y is 
known o "cndic miss*. When the slomKC* determination afj^orithm is wclMeugued the 
probabilily of a cache hit b grcnily improved, and oppareni response timt to user requests 

10 (felahnl lo QoS) in reduoed. Taiiher, every IfifbrntMioa request satisfied by {ocaUy cocbcd 
coRleQl (cache bh) reduces irafTtc on the (sctemal network, pcrnuUinu sboxter ivsponw; limes 
over the external network. 

]f the requested object is in the cache, it is dct! UelennircJ id a step 206 whether the 
)$ requested object is marked as static. Kso (Y), it is Ibcn ilclcrmitKi! m a step 208 vvhdbex to 
updsie ihc okikU copy or lo ose tl lo hlVili the iiscr reqiicst. imnj any suitable standard 
Blgorithm for caching sl&iic objects, siich as coxopsring the objects '»gp' (the (iine cispescd 
since il has lasi beisn refreshed) to the TTL (if the original server of the page specifies a 
TIL), asking Ihs server tbe latest luodificallon tiin^ ete). 

79 

If the requested object is in cacbe, and it is dynamic (N, step 206X it is cletemuned io a step 
210 whether the cached cgpy^ TTL ("slielf life") is less than its age. 

If lbs cached cop/s TTl^ is Jess than (a lower number than) its age (Y, j4ep 210% it is 
35 considered to be "staJe", and in n step 2 1 2 the cache: 

update:) tbe object's access statistics Countbcr of user requests per time 
period - hLst few minate.s la<:t hour, etc.); and 

— fcld)cs (he object fmiij the original server, 

30 If the cached copy's TTL is cqonl to or (^eater Ihon its age (N, step 2 1 0), it is considered to be 
"fifcsh", and in a step 21 4 the cache: 

-< fuijnits ihe reqticisi using the cacltcd copy; unJ 

- updates Ihc flhject's access statistics. 

3S Optional ty, if the time difference between the cached copy's age and its TTL is less thaft a 
given time, and the oDmber of lecenl vsa requests is mnre than a given rate, it is euosidered 
to be "u^pcd" wid "popolar". and Ibe cache fetches the obje<n from Ihe server in wfaul is lenned 
an 'aollcipated refresh". 

40 Addit1onalty> for each time the cache frtcbes vd ubject from the server (see steps 704 and 
212), the following procedures are perfonxied by the cache, in a step 216: 

a, it another fetch for ibf snme object 15 ongoing (e.g.. dtjc to a jxcviuus user request). 
Ihe «»che wans for the previous fetch to complete, rather than diiplicaiins it request; 
• b. fetches the object iTom tbe server, 
43 ' c. replaces its cached copy, if present, by the op- to-daie object, after having coitqxircd 
ihem to delennine M'helbf r the object had clwiiged since it was Issi fetched; 

d. initialize or opd3tc the object's , change statistics (mimbcr of changes per tifD« 
period) accordingly; 

e. mark the object as static or dynamic coitteol depemliji}; on the original server's 
50 rep^ (prescTKc of 0 modiJjCiition lime, or of a ocm^ero TTL, etc.); • 
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f. if the object is dynamic set its TTL (7) to the af^m^naia rjlne with respect to (be 
rtventge time berwcen dutniies t (deiemiincd fiom M» objects ttangp statlstksX Ibe jntmber 
Df user requests per time unit i\ (clelermincd fron the objecb uceess statistics), and a scKxieJ 
tmu nf the following pjocediutis (A-E): 

maxiniiuii emir probobilily ( p$ > which is Oio avenge ndVo of the munber of 
requests AilHIIed uring a cadicd copy whosv cwTes»)»M(Ii»g orij^oKl ot^ct has changed for 
more than a given tiise tV, over the totatj nomber of requ^s: 



£. mflximum eirur rote ( /ig ). which ts the average number per time unit of request); 
fulfiJIed using a cached copy whose corresponding uriginii) oligect has changed tor mote ttum 
a given time fV: 





f 


2 


u. 


w 





















LI, niaximuin delay { i>» ), wlxich is the overage time between «n object change and 
whoi the cached et^y is rcfreshtd: 



any conibinalira ofOie above (AJ)iQ. >aU^ tbc lowest 7^ or 

E, any combination of ihc above (A,B.C). but keeping T within a prcdetennincd 
window ofi^wnal^te" vajnes iMunded by 7'nitn .ind Jhaax. 

In the above equations, the following porametcrs have been used: 
7 is Titoe To 1 J ve for the ilynam ic object ; 

W is a given lioic since the otiginal object Ins changed (i.e., bow long i1 is outdated); 
' T is on avejage tbne between cbwges. v/tdch is detennined fton the object's diangc 

siotisdcs; 

n is number of user reqyests per liinc unh (e.g.. frequency); 

po if; aiaximnm error probability, which is the average ratio of tbc number of requests 
ftttfilled using a cached copy whose cfHrrcsponding Ofi^nal object lias changed for 
nioic than the 0vc» time W, over ihe tntal number of requests; 
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i\t is mninniin error mt, which i» the oven&e number per dme vnlt of request! fUfllKad 
usii^t a cadted copy whoM'correcpoiidins original otgect bus changed for mora tUm 
Uic given ihneH^, 

P« is moxirotnn delziy. which is the aveniKa iimt between ao ubjed cJiimye Mid when ihe 
5 cached copy i s rcin;:J led. 

SefrdJng and Adjusting TTI* 
/V simple n>ode), which lakes oo Mber pomiseler than the .ivenge ihne faetwecn chanecs (t) 
of the object is die expooeotla) decresse model - Uwl is» (be piotabUltv dun me object does 
1 D not cbiinee during a lime period of laifclh f w considered lo be of the fwtn c Tt i9> of 
cotireo, I for f «=* 0, unil taxis lo 0 as / tends to + (co) 

Considenn^ a situalioa mtb inaay oser reqtiests, as compared to the lefhesti rate of the cached 
cup/, so (he cache will iipdate at regulitr k^al^ dcsjaiie bdns uscr^ivcn. 

IS 

/V ven«^e &eoucncv of chwgcs 

The moiteFs consisteiicy is now checked especially whclbcr the average update frequenqf of 
a ]^vco objed H l> x. Lcl p', (/) be ihc fsubobility that k changes over a time poiod of /■ 
By construction: 

20 

And. for A > 0, = ipl,mi-c-^">'"-^'" - { 



(thir piobaUIity Ihat k chan{^ occui over |0:l] is the suro over a of Ibe 
23 probability lhc»t * - J changes occurred between 0 and n, that Ibcrt waa a 

ciianf^e between u and ir^ilii, ant! oone between rt+du and r)- 



Onc can deduce by recurrence that: ^ ., 




•15 
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Now the orveiasB number of changes over o ihne period of / is (by dcfinitinn of oven^tg): 



s Hence» the chnngcs do have an svcrege fnqaency of V X. 
Average c/ror probability 

The error p-obability is defined aa beiog the percentnge of cases where the user receives the 
cjjche*! copy whereas the objct-t hsis chunged on ibe oriRina) server, ;a)J ihc aicbcd copy is 

10 wiiitatcd by more tlwn tt given lime W. Figure 3 ilhistaites the pereeoJagc mor (vertkaj 
Mis, in pccccalBgcs) versus tinjc (horizontaJ axis, in seconds) for instance, with W-1 5s, t - 
6Qs, and r - 305. llK ^pfa illustrates tho piobobility that the cocbed copy is "stale' by oKf e 
ihon seconds, b) the graph, ;f is ) 5 seconds, £0 the probability is zero betweeo ] ondl5 
sceoods. TKeq, the probubilily rises until 30 seconds, ^vhcn u re-fetch occurs (7- 30s), OS 

\i illustrated btv the .sawtnoib paUem 302. Tbb pattern lepeaits itself every 30 seconds. 
Between 30 and 45 seconds, the probability is ttsoin zero, and a re-fttch occurs at 45 seconds, 
and the probabiUty rises until 60 seconds, as iitustrated by the sanlooth pattern 304. A 
sioiilar result Is sbowo by the sawtooth patlsm 306 between 75 and 90 sccoikIsl 

20 Given that tbo cache is supposed to update the copy of the object at rcgolar 'mtcrvab, the 
calcutstieiis can be pcrfbrmcd orycr a time interval [6;7J. 

The average time € within Ihe iniexvnl during which the conleni is upldated L, ihe sum over 
aJJ intervals fir, trfd/i] of the probability that there vfon no change hehvccn 0 and <t, bol that 
25 there Mras a change between h and « h!<i» njolliplicd by the length of time during which ihc 
content lemeins outdated 1^ more than W. Thai \r: 



K.* 1.1 




t 
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From ihu, the erm probability can be dediiced as: 
to yen- ^£ = l(cr-l + 0 



Ea^er-lo-hMDillc bininiis can be ciilciitoletl hjf ]enrng: p/^^ _ SL + jj ^ 1 4 



Its tltirivatives are: F (o) ts -~ — a 4 I - * ** 



F"'(a)=l-e- 

Since F(0) = F(0) = FXO) - 0, ami FXo) i.s poailjvc for any U >= 9, FXa), F'W a"^. 
ultimRtely, I'(a) aro as we)l. So: 
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There arc better, ensy-to-flnd botmds for tbe case of ii>»l . biJl ihat w noi im imeresltDg case 
(T-W5«T). Hence. 
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Maximal TTL to wmt a riven tmyr DroWWrrtr 

An upper bound should be set cm TTL which ftiU ensures that the oior prc^bility ts below a 
given ihitshoJcl pi. 



r 2 '^^ 



i 2p::r 
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Therefore, an sccepmble TTL is: 



MaarimalTTX H 

10 lUs 15 tbe siijie as for (he enrr pobobiliiy, icphcingp^ /rs / A as: 







1 


rf 


w 
















\ n i 





?S ThaefoiB, no siccepltiblc TTL is 



n } 



3> Averaw d elay for a uiven TTL 

It is useful to consider the average delay D between the cnchcd copy wmJ the original objcci 
in the case ^-ticrc the latter has changed during n {pvoi interval. FLnil, coWtilatft <A(t//)> 
which is the nv«roge dcJay knowing that tbe chorigr oocnmd at a lime « between 0 and T. 



2T 
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From this, the overall delay <Ao con he ileduced by {ntegnttlnj <St{itp times the pnil>ahtlity 
tbol it dinngp occiifred beiween n nnd h -» d v. wer {Oi,7}: 



^ r IT 



■ii^""t7)*'"'-'r"'^'f|*^'*) 



25 Since the <A> calculation does not (alee jnio account U»l tbere lias been a change within 
(0;7]. it ritould be divided hy (he oonespoodiog {mobility to get D: 



I 

l-e 
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andsince: £L^£- + a - !+c-iO 
$ 2 



5 



thcti: 



^ 1 t' I (T\' 



15 , 

In order to limit i) to a TfL sfeouW b« chosen, sach as: 

» 2' 

7* 7* 
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Fnmi (he nbv^t^ the followin)} lubles of Tit (r(»» versus a aiunUr uf cnw prvtobililies, 
fw varioan values of t oiid FFcsn be derived; 
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From IhB ftbo\«. the followiog conclwions can eeneraUy be made. 

10 llje probabilify of the retrieved infonniitictD being "stale' (older than W) is wsentiftUy vxro 
foi values of T(s) Cfimc (q li^x) less Uimd ot cqoaJ to W and increases with incrci«it« TO 
according to b dpcnyiijg exponat(i:i), approaching 1 00% t^bability of enw at infinity. This 
obscTVfttion that tlw emu pmbahility is rem for values of T(s) less than oj eqtiR) to W ii 
e^svutialty « "trivial" rewjt. since i( is clear tbiil no information can be older thtui W if it is 

IS updated D>ore frcijuenlly tb&n W. Cienriy, however, the average update inieivaJ t has ii 
significant eireci on bow steeply the euor prohahility climbs for vahies .of T(s) grcutcr tlun 
W. The uFMler the average apdaie interval t with respect to \Y. (he le«s shuply the ernw 
probobiSty rises with irKrcasing valoa of T(a). 

30 Figure 4 is a graph '100 illustrating tlw evolution of enw probability ptn (veilfcal axis, 
cxpre£sed as a perccniage (yO), as «i fimciioo of the TTL (in seconds (s)) Ibi vniious values of 
X {W^iSx). The line 402 is for T » 20$, the line 404 is frr x = ^0», the line 406 is for t » 
30DS, and the line 408 is for t - (OOi. 

25 A ^Qphical illustration of i\ (cchoiquB for cbiK).<;rni( a Tl'L to maintain error probability 
belvw a tbiwhold value jijj^ts oWaiiKd by idcntiJying the V4lue ol^on l)\c vertical axis oXtbc 
gniph 40O BJid following an iinogipory line horizonlally dcioss the giaph 40O lo where it 
intcr.sccts the cibtc (402,404,406, or 408) for tbe appropriate value of t. Hy vny of example, 
if an error piobabilily threshold value of 10% is chwcn» then the TTL for t » 20s (line 402) 

30 is a Utile umter 30 seconds* for x - 6Qs (line 404) it u sboul 35 seconds* for T » 300s (lire 
406) it is about tS seconds, and for t » fiOOs (Kne 40S) it is about ISO seconds. 
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By <tepk>yinc caches for rtaUc ond dynwuk croienl nl nppjopriate localioos in ihe Inrtract, 
and by selecting apprupriute upJa(t: chHrscieristics for cacbeil dyiiaoiH; cunlmt as described 
heKiiuhove^ eHective user respowte times to both dynamic and mtic content can be reduced 
while simolianeously reducing congestion on the ejilenja] nehvork li e., the Internet). 

3 

The first Itoc on ead wcr (110) rtcdves a web pngtt, the web page may optionally plant a 
GuoUc io Ibc tise(*$ brtnvscr. 'I'hcrcaAcf, every tsmt the cod iiscr .accesses ihc web page, Ihe 
brow^T sends the cookie olang wiUi lite access re<ji«*l' ^ cookie itHh Ihc jjervo (104) 
what Ihe server ^«nls (or needs) to know About the end user, in oddjtion to siniply the ihcl 
10 that the «scr xma to retrieve the web page. For cxampje, iJT ibe web page k a perfioftaltzcd 
\vcb page of n sinitle end user, the scm'er krvnvs the end vsex's preferences. 

According lo a fwmirc of Ihc invcntioo, (he cucbc (108) am look ut (he cookie and see if ihc 
requested web page is a personalized p«t^ of a sui^e end user, oi if the cookie indicates the 
15 Dccess request is coming from u (global end user. If (he rctinested \vrb page is a personalized 
w«b page for a single eod user (or if there is some other iudicatiod that the web p»ge is 
supposed to be fnodsfied cacIi time tbat it is ocoesscdX tbcji the web page should not be 
' uiched. This cai) be occompHsJied by setting ITLsO. EIm (&g« global end iwer), TTL is 
sintply set as desciibed bentinsboive. 

70 

Although the invrotion hus been described with respect to a tinnled number oremboOanents, 
il will be opprcciated (hat raraiy vprialioiu, isndificatioos aixl other applicaliuns of the 
invenlinn may be made, sod are Intended to be wilbio the scope of the i)iventi»w <a di9:k>xd 
herein. 

25 
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CLAIMS 

Whot is claimed Ik 

t . ^t^'stcm for cnching ohjecis loving dynamic content on the latcmet, comprising: 

a cucbc conncvicd m the loicruci foi :;to;in(i tuvi updatlog copies uf Uynsmic content. 

2. System, ftccMing to doiai 1, wljertin: 

(Ik: uAche is disposed at a location selected from the ffoap oon$i$tio$of nisjor Internet 
t.wilchiiig locftimu, dial>in aggi^^tion points, and corporate gslewsys. . 

3. .System, oceording to craim I , wlierein: 

(he cache a}so «tpns and updates copies of stslic conteal. 

4. bVsteio. ncconUng to claim 1» funber coniprising: 

oifims tot inoAiloriog one or nioK of die objects to ddemibie iipdato cliarRcteristics 
ihen»r; ood 

menns t'nr adju-sting a time tn live (TTL) peramdcr for the objects based upon tlic 
update cberacicristjcs. 

5. System, acconhng to cbuin 4, further con^msjng: 
nwRns for diitennining an age of on object; and 

meaj« for updating the object if the T71. for ihc obicct is ksa than its agp. 

C. System, accwdiog to claim 5, wherein: 

the ri L for each object is calculated ncconlinQ lo an average time uf changes for the 

obtjeet. 

7. System, according to claim 4, ftntbcr comprising: 

means for determining n prohobility of error for cstch of the objects; tmd 
meam for adjnsling (he TTL for each of the objects lo aiaintBiD its probability of 
error betow a predetomined emn-piobabili^ threshold. 

8. System, according loclaifflTf 'Wherein: 

the 'fTL for each object is calculated accwding to the equation: 




Whcrctii: 

T i 5 Ihc 'I fnie To Ul ve, for the dynamic ol^t; 

fy is a given time since the onginul object has changed (i.c^ how long it i$ 
ouldnlcd); 

1 is an average lime between changes, which, is determined &odi ibe object'.^ 
change siaiisticsr, and 

po is nixxiuium error prohitbilily. which is the avcrnge mtio of tiie number of 

reqiiesis fulfilled using o cached co|^ whose coxiespooding original object has 
changed for more th»n tl\t given time IK over the tvt;>I nwinbcr of icijncrt.s. 
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9. .System, aocnrding to claim 4, fiuthcr com|>rising: 

metns far detentiining on error rale for each of tb« objects; and 
nicBos for wljusling Trt fin each of the objects to mainruin its oror role below it 
predeteroiiQed error piubobilUy IhresholJ. 

10. System nccftrrliog to cJmid 9. whenan: 

Ihe TTL for each objcci is csicolatcd accordiog lo Ibc equation: 



\\bercin: 

T is (he Time To Live for the dyuamic object; 

fy is a gi-vcn tinK since ihc origb»] object hns changed (i c., how long it is 
outdated); 

T is BO overage line bd^vcn ch3nge9> which is delermlncO from the objects 
change stattslk^ 

n IS number ol'user requests per tiii>c unit (e.g.. frequency); and 

fit is maximum cnor rate, which is the overdge number per time Tnit of requests 

fulfilled imng « cached «9py whose conespoiidiiig or^inul object has dmnged 

for more than Ihc given thne W. 

1 1 . Syslctn. Qcrnrdmg to claiin 4, liirlhcr comprising: 

for determining a delay lime for each of ibe objects; and 
mcnns for adjuyting TTL for each of the (rf^tjccls to mftialain its defay lioK below a 
pteddemun^ delrv Ihreshuld. 

12. System iKcording to claim 1 1 . wherein: 

the TTT. for each object is calculated according to the eqtuUon: 



wherein: 

T is (he Time To Live for the dyronik objccl; 

t is an average time between changes^ which is dcterminod iraro the olgecf s 
change statistics; and 

1)0 Js maximum delay» wliicb is the ttvcrige lime between an object change and 
when the cached copy is refreshed. 

1 3. Syman, acrin^dlng to claim 4, further cpnTprisipg: 

means foi dclcnnining at least one object characlcristtc selected from the group 
consisting of error probability, error rate arid delay time for each of the objects; and 

means for at^usting TIL lo majtHohi the selected object charoctcrtics bdow a 
reqieclive threshold value. 




6A 
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14. STSlem, sccorJtng to claim I J, further comprblng: 

means for (imitiuR adnutincitt ot TTL for cdci} of the objects to a range bounded by 
predticnuincd tuimmuiTJ (7>nin) nnd mfwimum (7tn.'Vii)valiie& for'lTL. 

15. McOioU of rejpuniling to a u-ser tcqMsl for infoniiBtion Ibiviiig dynamic cunlcnl, 
comprishq;: 

slorinR El copy of ibc dynamic cnntenl In a cache; 
establishing a time to live (TTL) for tlic dynamic content; 
receiving the lucr rccpicst at the cache; 

responding to the user request with th« ^1o]«d cupy of (he dywflnie coOlent if its Tl'L 
b f^reater than ils »ge; and 

reuitviog on xn^nted copy of the dynamic contaij and rcjpondjnR to the 'user leqvesl 
with Ihc updated copy if ihc TTL of Oir stored copy is Jess ihan its ngt. 

16. Method, occordini; to claim 1 S, further contprisiRg: 
storing a copy of static content in ibe cache. 

1 7. • Method, aocartfing to daim 1 5, further GOiDfriaan^ 

dctermuimg an avcnttic update frcqooicy tor die dynunk content; owl 
detcnninng liie TTL fox the ityoagnc content as a functioo of its overace update 
ficquency. 

] S. Kleitind, according to claim I S, fiinhir comprisng: 

detenoining an avertige update freqoeni^ for the dj-Ktniic content; nnd 
deteiminint; the TTL for the dyamiic content as a functicm of lis axtsrago updata 

Cmiixatcy and a predetsimined error fiohahility threshold. 

19. Method, acoonlbBg to claim 18, Anther conqxisnis: 

adjusting the TTL for tin djvaniic cunleot acconlii^t to i Irequency of user requests 
for ihe dynajntc contait. 

20. Method, according to claim 1 5, wherein: 

the TIL for eaeh oigcct is cokulalcd according to the eq\i3ll<»i: 




whcictn: 

T is the Time To Live, for ihc d^-namlc object; 

is a given (imc since the original otjcvt has changed (i.ti.. how loog it is 
otitdaled); 

T is an average time between changes, which is dctcnnlncd frym ihe ohjccl's 
change fTtali sties; and 

n maxiratim en'or probability, which is the average ratio of the nimiher of 
requests fiiUilled ttsing a cached copy whose coirespondii^ orighifti oi^cct has 
changed for more than llie uiven time W, over the lota] immbet of requests. 



21. 



Method, accordiog to cleiut 15, wherein: 
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T is tbe Time To Liw fw the dynamic olqect; 

jy is R {pven time since (he originiil objecl has changed how long it is 

ootdiited); 

t is ftn average hiue between changes, which is tteleiroincd from the object's 
change stiOis^ics; 

n is nomhcr of uscf requests per time onil (e.g., frcqococy); and 
m is maximum error nitc, which is the average ounilxr ptt time unit of requests 
• fiitfilieU ushig 0 cached copy whose corTc.sp«n»lin(i originaj object has chnngetl 
for more than itie given time ?K 

Method, itccording In claim 1 5, wherein: 

the TTl, for cadi ohjeci is calculated according to the equation: 



1+3 



T is the Time To Liw for the dyrmmic Ol^ject; 

T is an average time between cbmges, which is dctetniined from the object's 

. change statistics; mid 
Do is niaxianun dett^y, whidi is lli« average tiute between an olgoct change and 
^'v{Ien the cached copy is refieKbed. 

23. Method, according to cintm I S. wherein the tnfbnnnlton is represented a web pogc, 
ibe method Jinlher comprising: 

a Grst time Ihe \isti ftcetves the web page, the web page f lanis a cookie in tbe usei** 
browser. 

24. Method, according to claim 23, i^^eia: 

every subscqocnt lime tho end iiscf request? the wsb page, the browser sends the 

coolcic. 

25. Method, occording to cjain) 24, further comprising: 

die cache looks at the cookie and sec if the requested web page is a persoiwUxcd page 
of a single end user. 

26. Method, acconling lo rlnitn 23. further cuinpf iaing: 
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if ibe requtsied wd> fogc is n jiersonafized wtb pase for a single eod user, iben tht 
web page is not cached. 

27. Meihiyl, nccoiJmg to claim \ 5, further comprining: 

if the infonnulion 19 suppoftii to be inodiflcd «icb time it is accessed, ssM'mg TTL - 

0. 



28. Metbod of responding to a user rcquKtl for un objcci baving dynaznic content, snid 
object originating from a server. compHsiog: 

storing a copy of the objcci tn u cncbe; 
esbiblishiog a tine to live (TTT^) for the object; 
receiving ihc user reipicsl at the cache; 

fulfiiling the user request with (ho stored copy of Ibe object if its TVL is greater than 
its oge: njvd 

fetching an ujjdatcd copy of the object boJ responding to ihc user le^ucst wilb Che 
updnitnJ copy if the TTL of the stored copy is Icsi than its ;»ge. 

29. Method, aocoidiog to c)siin28f&rthercaiDprisitig,i(l the cache: 

first settiDg the lit for the objet:t to a rcAsooaiMe lower lUtat (Ttnin); sod 
adjuEting the TTL fus the objed based on the troqitcAcy at which the ol^cct actually 
changes' 

30. Method, accoidiag to dmm 2Z. further comprising: 

each tbne the cache fetches the object frnm the server, performing the follo%vtng 
pFOceUures: 

a. if auothcr fetch for the .w»c object k onguing, woiling for ihe previous fetch to 
ooinplcte; 

b. fetching the object from Ihe server. 

c repjacing the cached copy, if present, by the fetched object, nflcr having compare J 
them to detennine whether tlie object had ch:}ngcd since it was last fetched: 
d. initialmog or updnting the object's chnnge stol isiics .iccordingly; 

c. maildng the object as static or dynamii: coolenf rtcpe«ding on the server's reply; and 
f. if the object is dynaroic, setting its TTL (r) to an appmpriale value with respect to 

an overage time between changes (t) delernuned from the object's chaugc statistics, the 
number of user requests per time unit (n) detennined from the ohgccfs access statistics, 
one of the rollo\Ying procedures (A-E): 

A. maximum error i^obabiliiy; 

U. onximuin error rate; 
' CX UHUuinwD delay; 

D. luiy combination of the above <A,B,CO, taking ihe lo^vest T; or 

E. any coniblnaiioo of the abo^'e (A.D,C), but keeping 7* within a predetermined 
window of "jreasooabie'' vah les bounded hy Tmiti and Ttfia\. 

3T . Method, accoidtng to ctattu 28, wherein the inibfmation is represented as a web page, 
the method ftttthcr comprising: 

a first tine the user receives the web page» the web page plnnts a cookio in the user's 
hrowsei. 



32. Method, according to claim 3 1 , wherein: 
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evciy GubseqiiBol time the end user requests the web page, the bnnvser sends the 

cookie. 

33. Method, according to claim 32, furttier comprising: 

the cache looks at U>o cookie mwl sec irilic requested web pogc is n personalized jttgt 
of n riugic end iiser. 

34. Method, according (o ciaitn 3 ) . Turtha cumprinne: 

if the rcqwstnl web page it a peraonnli/ed web page far a nngle end user, then Ike 
web foffi is not cacbed. 

35. Method, accofdine to clnim further coirtprtsing: 

if the infomiBtion \s siippa'tcd lo be modifie<) each time it is Accessed, selling YIX, 0. 
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